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Abstract 

Using  the  notions  of  Q-heaps  and  fusion  trees  developed  by  Fredman  and  Willard,  we 
develop  a  faster  version  of  the  fractional  cascading  technique  while  maintaining  the  linear 
space  structure.  The  new  version  enables  sublogarithmic  iterative  search  in  the  case  when  we 
have  a  search  tree  and  the  degree  of  each  node  is  bounded  by  O(loge  n),  for  some  constant 
e  >  0,  where  n  is  the  total  size  of  all  the  lists  stored  in  the  tree.  The  fast  fractional 
cascading  technique  is  used  in  combination  with  other  techniques  to  derive  sublogarithmic 
time  algorithms  for  the  geometric  retrieval  problems:  orthogonal  segment  intersection  and 
rectangular  point  enclosure.  The  new  algorithms  use  0(n)  space  and  achieve  a  query  time 
of  O (log  n/  log  log  n  +  /),  where  /  is  the  number  of  objects  satisfying  the  query.  All  our 
algorithms  assume  the  version  of  the  RAM  model  used  by  Fredman  and  Willard. 


1  Introduction 

Fractional  cascading  [5]  is  a  powerful  technique  for  dealing  with  the  so  called  iterative  search 
problem  [5].  Let  G  be  a  graph  such  that  a  sorted  list  is  associated  with  each  of  its  vertices. 
Given  a  key  q  and  a  subgraph  G' ,  the  iterative  search  problem  is  to  search  for  q  in  each  of 
the  lists  associated  with  the  vertices  in  G' .  This  problem  appears  to  be  crucial  in  developing 
efficient  algorithms  for  a  wide  variety  of  problems,  and  in  particular  for  geometric  retrieval 
problems  [4,  6,  17,  3,  7,  24], 

Let  the  overall  size  of  the  lists  be  n,  and  let  the  number  of  vertices  in  the  subgraph  G' 
be  p  such  that  the  degree  of  each  vertex  is  bounded  by  c.  Then  the  fractional  cascading 
structure  enables  searching  for  the  key  q  in  these  lists  in  O(log  n  +  plog  c)  time.  This  result 
is  optimal  under  the  pointer  machine  model  [22],  However,  its  optimality  in  more  powerful 
computational  models,  such  as  the  RAM  model  and  its  variationsfl,  18,  10],  is  not  clear.  In 

*  Supported  in  part  by  the  National  Science  Foundation  through  the  National  Partnership  for  Advanced 
Computational  Infrastructure  (NPACI),  DoD-MD  Procurement  under  contract  MDA90402C0428,  and  NASA 
under  the  ESIP  Program  NCC5300. 


1 


particular,  Fredman  and  Willard  showed  in  [10]  that  such  search  operations  can  be  achieved 
in  worst  case  sublogarithmic  time,  which  is  impossible  on  the  pointer  machine  model.  An 
important  fact  of  the  fractional  cascading  structure  is  that  the  complexity  of  the  search 
operations  on  the  vertices  other  than  the  first  one  is  affected  significantly  by  the  bound  c  on 
the  degree  of  each  vertex  in  G.  We  show  in  this  paper  that,  under  the  RAM  model  used  by 
Fredman  and  Willard,  if  G  is  a  tree  whose  degree  is  bounded  by  a  logarithmic  function  of  n, 
i.e.  c  =  loge  n  for  some  constant  e,  it  is  possible  to  search  each  list,  except  the  one  associated 
with  the  root,  in  constant  time  (independent  of  n)  while  still  maintaining  0(n)  space  for  the 
data  structure.  We  call  this  new  fractional  cascading  structure  fast  fractional  cascading. 

We  believe  that,  by  applying  the  fast  fractional  cascading  technique,  one  should  be  able  to 
improve  the  asymptotic  upper  bounds  of  a  wide  range  of  geometrical  retrieval  problems.  This 
is  due  to  the  fact  that  binary  search  trees  whose  nodes  are  each  equipped  with  sorted  lists 
are  often  the  crucial  components  in  many  data  structures  for  geometrical  retrieval  problems. 
Ensuring  that  a  constant  amount  of  time  is  spent  at  each  node  is  often  critical  in  designing 
these  structures  and  can  be  accomplished  using  the  standard  fractional  cascading  technique. 
The  fusion  tree  technique  [10,  11]  makes  it  possible  to  further  reduce  the  search  complexity 
of  some  of  these  structures  by  increasing  the  degree  of  the  binary  trees  to  loge  n  so  that  the 
height  of  the  tree  is  reduced  to  O(log  n/  log  log  n).  Even  though  at  query  time  the  branch 
operation  at  each  node  can  be  performed  in  constant  time,  the  fact  that  the  degree  of  the  tree 
is  now  dependent  on  n  makes  the  standard  fractional  cascading  technique  ineffective.  In  fact, 
a  search  operation  performed  on  the  list  associated  with  a  tree  node  will  require  O(log  log  n) 
time,  which  negates  the  effect  of  the  “fattened”  tree.  Only  in  some  special  cases,  such  as 
when  the  list  of  a  node  is  a  superset  of  that  of  its  each  children,  constant  search  time  can 
be  achieved  by  equipping  with  each  element  of  the  parent  of  loge  n  pointers  interconnecting 
the  elements  in  the  parent  with  those  of  its  children,  as  Willard  did  in  solving  the  three- 
dimensional  dominance  aggregation  problem  [24],  Even  in  such  cases,  the  storage  cost  is 
increased  by  a  factor  of  loge  n.  On  the  other  hand,  our  fast  fractional  cascading  technique 
achieves  the  constant  time  search  at  each  node  while  maintaining  the  linear  space  constraint. 

We  apply  our  techniques  to  two  well-known  problems:  orthogonal  segment  intersec- 
tion}  and  rectangular  point  enclosure.  Each  of  our  algorithms  achieves  0(n)  space  and 
O(log  n/  log  log  n  +  /)  query  time  where  /  is  the  number  of  objects  satisfying  the  query.  The 
best  previous  results  require  query  time  O(logn  +  /)  when  using  linear  space  [4,  17]. 

We  now  introduce  these  two  problems  formally.  To  facilitate  our  explanation,  we  will 
denote  a  horizontal  (resp.  vertical)  segment  in  a  two-dimensional  space  as  (aq ,x2;y)  (resp. 
(x;  j/i,  j/2)),  where  (aq,2/)  and  (x2}y)  (resp.  (x,y  1)  and  (x}y2))  are  its  two  endpoints  and 
x\  <  x2  (resp.  2/1  <  y2)). 

•  Orthogonal  segment  intersection.  Given  a  set  S  of  n  horizontal  segments,  report  the 
subset  Q  of  segments  that  intersect  a  given  vertical  segment.  We  say  a  horizontal 
segment  (aq ,x2;y)  intersects  a  vertical  segment  (ay  2/1,  y2)  if  and  only  if  x1  <  x  <  x2 
and  2/1  <  y  <  y2.  We  call  the  segments  in  Q  proper  segments  relative  to  the  given 
query. 

•  Rectangular  point  enclosure.  Let  (aq,  x2 ;  y  1,  y2)  denote  a  rectangle  in  a  two-dimensional 
space  with  edges  parallel  to  the  axes,  where  the  intervals  [aq,aq]  and  [221,2/2]  are  the 
projections  of  this  rectangle  to  the  x-axis  and  y-axis  respectively.  Given  a  set  of 
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S  of  rectangles,  report  the  subset  Q  of  proper  rectangles  such  that  each  rectangle 
,r2;  ;yi,  :y2)  ill  Q  contains  a  query  point  (x,y),  i.o>  aq  <  x  <  x2  and  jq  <  y  <  y2. 

In  this  paper,  we  use  the  modified  RAM  model  described  in  [10].  In  this  model,  it  is 
assumed  that  each  word  contains  w  bits,  and  the  size  of  a  data  set  never  exceeds  2“',  i.e. 
w  >  log2  n.  In  addition  to  arithmetic  operations,  bitwise  logical  operations  are  also  assumed 
to  take  constant  time. 

The  next  section  introduces  some  well-known  techniques  that  will  be  heavily  utilized 
in  the  rest  of  the  paper.  In  Section  3,  we  present  the  fast  fractional  cascading  structure, 
while  Sections  4,  and  5  respectively  give  the  improved  algorithms  for  orthogonal  segment 
intersection  and  rectangular  enclosure. 


2  Preliminaries 

Given  a  set  S  of  multi-dimensional  points  (aq,  x2, .  .  tqqq),  a  point  with  the  largest  ay- 
coordinate  smaller  than  or  equal  to  a  real  number  a  is  called  the  Xi-predecessor  of  a  and  the 
one  with  the  smallest  ay-coordinate  larger  than  or  equal  to  a  is  called  the  xt-successor  of  a. 

2.1  Cartesian  Trees 

The  notion  of  a  Cartesian  tree  was  first  introduced  by  Vuillemin  [23]  (and  rediscovered  by 
Seidel  and  Aragon  [20]).  A  Cartesian  tree  is  a  binary  tree  defined  over  a  finite  set  of  2-D 
points  sorted  by  their  x-coordinates,  say  (|q, .  .  .  ,pn).  Let  pt  be  the  point  with  the  largest 
^-coordinate.  Then  pt  is  associated  with  the  root  w  of  C .  The  two  children  are  respectively 
the  root  of  the  Cartesian  trees  built  on  |q, .  .  .  ,p8_ i  and  pt+ 1, .  .  .  ,pn.  Note  that  the  left  (resp. 
right)  child  of  w  does  not  exist  if  i  =  1  (resp.  i  =  n).  Figure  1  shows  an  example  of  the 
Cartesian  tree. 


Figur#  1:  Cartesian  tree. 


An  important  property  of  the  Cartesian  tree  is  given  by  the  following  observation  [12]: 

Observation  2.1.  Consider  a  set  S  of  2-D  points  and  the  corresponding  (x,  y)-Cartesian  tree 
C .  Let ■  oq  <  x-2  be  the  x-coordinates  of  two  points  in  S  and  let  a  and  (3  be  their  respective  nodes 
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in  C .  Then  the  point  with  the  largest  y-coordinate  among  those  points  whose  x-coordinates 
are  between  x1  and  x2  is  stored  in  the  nearest  common  ancestor  of  a  and  fd . 

Using  Observation  2.1,  combined  with  the  techniques  to  compute  the  nearest  common 
ancestors  [13]  (see  also  [2])  in  constant  time,  we  have  shown  in  [21]  that  we  can  handle  the 
so-called  3-sided  two  dimensional  range  queries  efficiently.  Briefly,  a  point  (a,  b )  satisfies  the 
3-sided  query  (aq,  x2,  y),  x1  <  x2,  if  x1  <  a  <  x2  and  b>y. 

Lemma  2.1.  By  preprocessing  a  set  ofn  two-dimensional  points  to  construct  a  (x,y)-Cartesian 
tree  C ,  we  can  handle  any  three-sided  two-dimensional  range  query  given  as  (aq,  x2,  y),  with 
aq  <  x2,  in  0{t(n)  +  f)  time,  where  t(n )  is  the  time  it  takes  to  find  in  C  the  leftmost  and 
right  most  nodes  whose  x-coordinates  fall  within  the  range  [aq,aq]  and  f  is  the  number  of 
points  reported. 

Note  that  C  should  be  transformed  into  a  suitable  form  to  enable  the  computation  of 
nearest  common  ancestors  in  constant  time. 

2.2  Q-heaps  and  Fusion  Trees 

Q-heaps  and  fusion  trees ,  developed  by  Fredman  and  Willard  [10,  11],  achieve  sublogarithmic 
search  time  on  one-dimensional  data.  While  the  Q-heap  data  structure  was  proposed  later 
than  fusion  tree,  it  can  be  used  as  a  building  block  for  the  fusion  tree  [24],  Using  Q-heaps 
and  fusion  trees,  Willard  demonstrated  in  [24]  theoretical  improvements  for  a  number  of 
range  search  problems. 

Q-heap  [11]  supports  insert,  delete,  and  search  operations  in  constant  time  for  small 
subsets  of  a  large  data  set  of  size  n.  Its  main  properties  are  given  in  the  following  lemma 
(the  version  presented  here  is  taken  from  [24]). 

Lemma  2.2.  Suppose  S  is  a  subset  with  cardinality  m  <  log1/5  n  lying  in  a  larger  database 
consisting  ofn  elements.  Then  there  exists  a  Q-heap  data  structure  of  size  0(m)  that  enables 
insertion,  deletion,  member,  and  predecessor  queries  on  S  to  run  in  constant  worst  case  time, 
provided  access  is  available  to  a  precomputed  table  of  size  o(n). 

Note  that  the  look-up  table  of  size  o(n)  referred  to  above  is  shared  by  all  the  Q-heaps 
built  on  subsets  of  this  larger  database. 

The  fusion  tree  built  on  the  Q-heap  achieves  linear  space  and  sublogarithmic  search  time. 
The  following  lemma  is  a  simplified  version  of  Corollary  3.2  from  [24], 

Lemma  2.3.  Assume  that  in  a  database  ofn  elements,  we  have  available  the  use  of  precom¬ 
puted  tables  of  size  o(n).  Then  it  is  possible  to  construct  a  data  structure  of  size  0(n)  space, 
which  has  a  worst-case  time  O(log  n/  log  log  n)  for  performing  member,  predecessor  and  rank 
operations. 

Notice  that  the  assumptions  of  the  RAM  model  introduced  in  Section  1  are  critical  to 
achieving  the  bounds  claimed  in  the  above  lemmas. 
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2.3  Adjacency  Map  and  Hive  Graph 

The  notion  of  the  adjacency  map  was  first  introduced  by  Lipski  and  Preparata  [15].  Given  a 
set  S  of  n  horizontal  segments,  the  vertical  adjacency  map  G(S)  is  constructed  by  intercon¬ 
necting  the  horizontal  segments  in  S  using  vertical  (infinite,  semi-infinite,  or  finite)  segments 
as  follows:  from  each  endpoint  of  the  segments  in  ,5',  draw  two  rays  shooting  upward  and 
downward  respectively  until  they  meet  other  segments  in  S  except  possibly  at  an  endpoint. 
This  creates  a  planar  subdivision  G(S)  with  0(n)  vertices,  which  are  the  joints  of  the  hor¬ 
izontal  and  vertical  segments.  G(S)  can  be  represented  in  0(n)  space  using  the  adjacency 
lists  associated  with  the  vertices.  We  call  the  edges  supported  by  the  horizontal  segments 
horizontal  edges  and  those  supported  by  the  vertical  segments  vertical  edges. 

Chazelle  noticed  in  [4]  that  the  adjacency  map  is  a  useful  tool  which,  when  modified 
appropriately,  can  be  used  to  handle  the  orthogonal  segment  intersection  problem  efficiently. 
His  modification  of  the  vertical  adjacency  map  is  called  the  hive  graph.  A  hive  graph  H(S) 
is  derived  from  G(S)  by  adding  only  vertical  segments  to  G(S)  while  maintaining  0(n) 
vertices  and  0(n)  space  representation.  However,  it  has  the  important  property  that  each 
face  may  have,  in  addition  to  its  four  (or  fewer)  corners,  at  most  two  extra  vertices,  one  on 
each  horizontal  edge.  Figure  2  shows  a  vertical  adjacency  map  and  its  corresponding  hive 
graph,  in  which  the  additional  vertical  edges  are  depicted  as  dashed  lines.  By  assuming  that 
the  endpoints  of  the  segments  in  S  all  have  distinct  x-  and  y-coordinates,  as  [4]  did,  one  can 
conclude  that  each  face  of  H(S)  has  0(1)  vertices  on  its  boundary.  Given  a  query  segment 
(,r;  ;yi,  2/2),  the  segment  intersection  query  can  be  handled  as  follows.  We  first  find  the  face 
in  H(S)  that  contains  the  endpoint  {x,y i)  in  O(log??)  time  by  using  one  of  the  well  known 
planar  point  location  algorithms  [8,  9,  14,  16,  19].  Then  we  traverse  a  portion  of  H(S)  from 
bottom  up  following  the  direction  from  (x,  y i)  to  (x,  y 2).  Only  a  constant  number  of  vertices 
are  visited  between  two  consecutive  encounters  of  the  horizontal  edges  that  intersect  the 
query  segment. 


Figure  2:  A  hive  graph 


Note  that  the  vertical  boundary  of  a  face  of  the  hive  graph  corresponding  to  a  vertical 
adjacency  map  will  not  necessarily  contain  a  constant  number  of  vertices  if  the  assumption 
that  the  endpoints  of  the  segments  in  S  have  distinct  x  coordinates  does  not  hold.  Since 
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we  will  need  to  deal  later  with  such  a  case,  we  get  around  this  problem  by  associating  with 
each  vertex  pointers  to  the  upper-right  or  upper-left  corner  in  the  same  face  as  follows.  We 
modify  H(S)  by  associating  with  each  vertex  [3  two  additional  pointers  p{/3)  and  q{/3).  Let 
(3  =  8\,82,  ■  ■  ■  ,81  =  7  be  the  maximal  chain  of  vertices  such  that  each  pair  of  consecutive 
vertices  6t  and  <5i+i  is  connected  by  a  vertical  edue-f  and  <5i+i  is  above  6t,  for  i  =  1, ...,/  —  1. 
Note  that  this  chain  can  be  empty  (/  =  1)  in  which  case  both  p(/3)  and  q(/3)  are  null.  If  there 
exists  a  vertex  in  {82,83, .  .  .  ,81}  which  has  a  horizontal  edge  connecting  it  to  a  vertex  to 
the  left  of  it,  then  p{/3)  points  to  the  lowest  such  vertex.  Otherwise,  p{/3)  is  null.  Similarly, 
if  there  exists  a  vertex  that  has  a  horizontal  edge  connecting  it  to  a  vertex  to  the  right  of 
it,  then  q{/3)  points  to  the  lowest  such  vertex.  Otherwise,  q{/3)  is  null.  It  is  easy  to  see 
that,  using  these  additional  pointers,  we  can  in  constant  time  reach  the  next  proper  segment 
without  the  distinct  x  coordinates  assumption.  Figure  3  shows  such  a  modified  hive  graph. 
The  additional  pointers  p{/3)  and  q{/3)  are  depicted  respectively  as  dashed  and  dotted  arrows. 
To  simplify  the  drawing,  we  omit  the  pointer  a  =  p{/3)  or  a  =  q{/3)  if  a  is  null  or  {a,  (3) 
is  an  edge  in  H(s).  This  figure  also  illustrates  the  search  path  of  an  exemplary  segment 
intersection  cpiery  by  highlighting  the  pointers  involved. 


Figure  3:  Modified  hive  graph. 

As  noted  in  [4],  when  the  cpiery  segment  is  semi-infinite,  that  is,  consists  of  a  ray 
(x;  —oo,y-2)  shooting  downward,  there  is  no  need  to  perform  the  initial  planar  point  lo¬ 
cation  cpiery.  Instead,  we  can,  during  preprocessing,  sort  the  x-coordinates  of  the  vertical 
edges  of  the  faces  unbounded  from  below,  and  perform  as  the  first  step  at  cpiery  time  a 
search  on  the  sorted  x-coordinates  to  locate  the  face  that  contains  the  point  (x,—oo).  The 
following  lemma  is  a  restatement  of  Corollary  1  in  [4], 

Lemma  2.4.  Given  a  set  S  of  n  horizontal  segments  in  the  plane,  an  0(n)  space  hive  graph 
can  be  used  to  determine  all  the  intersections  of  the  horizontal  segments  with  a  semi-infinite 
vertical  query  segment  s  =  (x;  —oo,y2)  in  0(t(n)  +  /)  time,  where  f  is  the  number  of 
intersections,  and  t(n)  is  the  time  it  takes  to  search  the  sorted  list  of  x-coordinates. 

Combining  Lemmas  2.3  and  2.4,  we  have  the  following  Corollary. 
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Corollary  2.1.  Given  a  set  S  of  n  horizontal  segments  in  the  plane  and  a  vertical  query 
segment  in  the  form  of  (x;  — oo ,y2),  H  is  possible  to  report  all  f  proper  segments  of  S  in 
O(log  n/  log  log  n  +  f)  time  using  0(n )  space. 

Clearly,  by  rotating  the  hive  graph  90°  clockwise  (resp.  counterclockwise),  the  same  type 
of  techniques  will  yield  a  solution  for  handling  any  orthogonal  segment  intersection  query  that 
involves  a  set  S  of  vertical  segments  and  a  horizontal  query  segment  of  the  form  (  —  00,  x2]  y) 
(resp.  (aq,  +00;  y)).  We  will  denote  this  rotated  hive  graph  HL(S)  (resp.  HR(S)). 


3  Fast  Fractional  Cascading 

Suppose  we  have  a  tree  T  =  (V,  E)  rooted  at  w  such  that  each  node  v  has  a  degree  bounded 
by  c  and  contains  a  catalog  L(y)  of  sorted  elements.  Let  n  denote  the  total  number  of 
elements  in  these  catalogs.  A  key  value  k(g)  from  3?  U  {  —  00,  +00}  is  associated  with  each 
element  g  in  L(y).  The  elements  in  L(y)  do  not  need  to  have  distinct  key  values.  We  call 
such  a  tree  a  catalog  tree.  Let  x  be  a  real  number  and  F  be  an  arbitrary  forest  with  p  nodes 
consisting  of  subtrees  of  T  determined  by  some  of  the  children  of  w.  Both  x  and  F  can  be 
specified  online,  i.e.,  not  necessarily  at  preprocessing  time.  Let  (Tl{x )  denote  the  successor 
of  x  in  a  catalog  L.  The  iterative  search  problem  is  defined  as  follows  [5]:  report  (Tl(v){x) 
for  each  u  in  F .  Fractional  cascading  is  a  technique  for  solving  the  iterative  search  problem 
efficiently,  and  is  briefly  introduced  next. 

3.1  Fractional  Cascading 

The  following  lemma  is  a  direct  derivation  from  the  one  given  by  Chazelle  and  Guibas  for 
identifying  the  successor  of  a  value  x  in  each  of  the  catalogs  in  F  [5]. 

Lemma  3.1.  There  exists  a  linear  size  fractional  cascading  data  structure  that  can  be  used 
to  determine  the  successors  of  a  given  value  x  in  the  catalogs  associated  with  F  in  0(p  log  c  + 
t(n))  time ,  where  t(n )  is  the  time  it  takes  to  identify  the  successor  of  x  in  L(w). 

The  main  component  of  a  fractional  cascading  structure  is  the  notion  of  the  augmented 
catalogs.  At  each  node  v  in  T,  in  addition  to  the  original  catalog  L(v),  we  store  another  aug¬ 
mented  catalog  A(v),  which  is  a  superset  of  L(y)  and  contains  additional  copies  of  elements 
from  the  augmented  lists  associated  with  its  parent  and  children.  With  each  element  h  in 
A(v),  we  associate  a  pointer  to  its  successor  (JL(v){h)  in  L(y).  Since  A(v)  is  a  superset  of 
L(v)}  we  have  a L(v){g)  =  ^L(v)(aA(v)(g))-  Note  that  the  elements  in  an  augmented  list  A(v) 
form  a  multiset  S(v),  that  is,  a  single  element  can  appear  multiple  times  in  an  augmented 
list.  The  elements  in  an  augmented  list  are  chained  together  to  form  a  doubly  linked  list. 

As  illustrated  in  Figure  4,  let  u  and  v  be  two  neighboring  nodes  in  T,  u  being  As 
parent.  There  exists  a  subset  B(u}v)  of  A(u)  X  A(v)  such  that,  for  each  pair  of  elements 
(g,h)  G  B(u}v)}  k(g)  =  k(h).  The  pair  of  elements  (g,h)  are  called  a  bridge.  There  is  a 
pointer  to  h  associated  with  the  element  g,  and  similarly  a  pointer  to  g  is  associated  with 
h.  We  will  call  g  a  down-bridge ,  and  h  an  up-bridge ,  associated  with  the  edge  (u,v).  It  is 
important  to  point  out  that  each  element  in  an  augmented  list  can  serve  as  at  most  one 
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up-bridge  or  one  down-bridge,  but  not  both.  Bridges  respect  the  ordering  of  equal-valued 
elements  and  thus  do  not  “cross”.  This  guarantees  that  B(u,v)  can  be  ordered  and  the 
concept  of  gap  presented  next  is  well  defined.  In  this  ordered  set  B(u,v),  the  bridge  (g,h) 
appears  after  the  (</',  h')  if  and  only  if  g  appears  after  g'  in  A(u).  A  gap  &'(„,„)(</,  h)  of  bridge 
(g,h)  is  defined  as  the  multiset  of  elements  from  both  A(u)  and  A(u)  which  are  strictly 
between  two  bridges  (g,h)  and  (g',  h'),  where  (g',  h')  is  the  bridge  that  appears  immediately 
before  (g,h)  in  B(u,v).  Accordingly,  we  define  the  up-gap  (resp.  down-gap)  G(u>v)(g)  (resp. 
G(i i,v){h)]  as  the  subset  of  G(u>v)(g,  h)  containing  elements  from  A(u)  (resp.  A(u)),  preserving 
their  orders  in  the  respective  augmented  catalogs. 


Figure  4:  Fractional  cascading. 

The  fractional  cascading  structure  maintains  the  invariant  that  the  size  of  any  gap  cannot 
exceed  6c  —  1.  Chazelle  and  Guibas  provided  in  [5]  an  algorithm  that  can  in  0(n)  time 
construct  such  a  data  structure;  and  they  prove  that  it  requires  0(n)  space. 

Given  a  parent-child  pair  (u,v)  £  A,  suppose  we  know  the  successor  cy4(u)(,r)  of  a  value 
x  in  A(u),  we  follow  A(  w)  along  the  direction  of  increasing  values  to  the  next  down-bridge 
g  connecting  u  and  v  (it  could  be  cy4(u)(,r)  itself  if  it  is  a  down-bridge),  cross  it  to  its 
corresponding  up-bridge  h,  and  scan  A(u)  in  the  opposite  position  until  the  successor  of  x 
in  A(u)  is  encountered.  Clearly,  cy4(u)(,r)  is  guaranteed  to  be  found  by  this  process.  The 
constraint  on  the  gap  size  ensures  that  the  number  of  comparison!  required  is  0(c). 

When  c  is  a  constant,  the  above  result  is  optimal.  When  c  is  large,  Chazelle  and  Guibas 
used  the  so  called  star  tree  to  achieve  O(logc)  search  time  on  each  catalog  except  the  one 
stored  at  the  root. 

3.2  Fast  Fractional  Cascading 

The  fractional  cascading  structure  described  above  is  strictly  list  based,  and  hence  all  the 
related  algorithms  can  run  on  a  pointer  machine  within  the  complexity  bounds  stated.  Us¬ 
ing  the  variation  of  the  RAM  model  introduced  in  Section  1  and  the  Q-heap  technique 
of  Frechnan  and  Willard  [11],  summarized  in  Section  2.2,  we  can  achieve  constant  search 
time  (independent  of  c)  per  node  for  the  class  of  catalog  trees  whose  degree  is  bounded 
by  c  =  logA?,  while  simultaneously  maintaining  a  linear  size  data  structure.  We  call  this 
version  fast  fractional  cascading.  This  result  improves  over  our  previous  result  in  [21],  which 
achieves  the  same  search  complexity  but  requires  non-linear  space.  We  will  first  revisit  the 
non-linear  space  solution  and  then  explain  how  to  reduce  the  storage  cost  to  linear. 
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3.2.1  Fast  Fractional  Cascading  with  Non-linear  Space 

We  augment  the  fractional  cascading  structure  described  in  Section  3.1  by  adding  two  types 
of  components  to  each  augmented  catalog  A(v).  First,  we  associate  c  additional  pointers 
Pi(g)}P2(g)i  ■  ■  ■  ,Pc(g)  with  each  element  g  in  A(v)  such  that  pi(g)  points  to  the  next  down- 
bridge  (possibly  g  itself)  connecting  v  to  uy,  where  uy  is  the  zth  children  of  v  from  the  left. 
Second,  we  build  for  each  up-gap  G(u^(h)  a  Q- heap  Q(h),  containing  elements  in  G(u^(h) 
with  distinct  values  (choosing  the  first  one  if  multiple  elements  have  the  same  value).  For 
large  enough  n  we  have  6c  —  1  <  log1/5  n\  and  therefore  Lemma  2.2  is  applicable.  We  have 
added  c  pointers  for  each  of  the  elements  in  the  augmented  catalogs,  whose  overall  size 
cannot  exceed  0(n).  In  addition,  a  global  look-up  table  of  size  0(n)  is  used  to  serve  all 
the  Q-heaps.  And  finally,  since  no  two  up-gaps  in  an  augmented  catalog  overlap,  since  they 
correspond  to  the  same  edge  in  T  (which  is  not  true  for  a  general  graph)  the  Q-heaps  cannot 
consume  more  than  0(n)  space. 

Now  suppose  we  have  found  g  =  a a(u)(x)  in  A(u).  Let  v  be  the  zth  child  of  u.  By 
following  the  pointer  pi(g),  we  can  reach  in  constant  time  the  next  down-bridge  in  u  and 
then  its  companion  up-bridge  h  in  v.  Using  Q(h),  we  can  find  the  successor  of  x  in  G(u^(h) 
in  constant  time. 

Lemma  3.2.  Let  c  =  O(\ogen).  The  fast  fractional  cascading  structure  described  above 
allows  the  identification  of  the  successors  of  a  given  value  x  in  the  catalogs  associated  with  F 
in  0(p  +  t(n))  time ,  where  t(n )  is  time  it  takes  to  identify  the  successor  of  x  in  L(w).  This 
structure  requires  O(cn)  space. 

3.2.2  Fast  Fractional  Cascading  with  Linear  Space 

We  partition  each  augmented  catalog  A(u)  into  p  =  |~|A(u)|/c]  blocks  .  .  . ,  Bp  each, 

except  possibly  the  last  one,  containing  c  elements.  For  each  block  Bi  starting  from  the  /th 
element  of  A(u),  we  construct  a  set  Ci  of  t  <  7c—  1  records  as  follows.  For  each  down-bridge 
g  that  is  the  dth  element  in  A(u),  where  l  <  d  <1  -\-7c  —  2,  we  include  in  Ci  a  record  r  that 
contains  two  entries  r.ptr  and  r.key.  The  entry  r.ptr  is  a  pointer  to  g,  and  r.key  is  the  key 
of  r  whose  value  is  defined  as  r.key  =  j  *  (7c  —  1)  +  (d  —  l)  if  g  is  associated  with  the  edge 
connecting  u  and  its  (j  +  l)th  child  (note  that  r.key  can  fit  in  a  word).  The  records  in  Ci 
are  sorted  in  increasing  order  by  their  key  values.  Now  let  g  be  the  successor  of  a  value  x 
in  A(u)  and  suppose  we  want  to  find  the  successor  of  x  in  the  augmented  catalog  associated 
with  the  ( j  +  l)th  child  v  of  u.  It  is  easy  to  determine  in  constant  time  the  block  Bi  to 
which  g  belongs  and  its  position  /  relative  to  the  starting  position  of  Bi  (f  =  0  if  g  is  the 
first  element  in  Bi).  If  g  is  itself  a  down-bridge  associated  with  (u,v),  then  we  are  done. 
Otherwise,  due  to  the  invariant  regarding  the  gap  size,  the  next  down-bridge  h  associated 
with  (u,v)  must  have  a  corresponding  record  in  Ci.  The  following  lemma  transforms  the 
problem  of  finding  h  to  a  successor  search  in  Ci. 

Lemma  3.3.  The  record  in  Ci  that  corresponds  to  h  is  the  successor  of  the  value  y  = 
j  ■  (7c  —  1)  +  f . 

Proof.  First  we  notice  the  fact  that  all  the  keys  of  the  records  in  Ci  are  distinct.  Let 
y'  =  j  ■  (7c  —  1)  +  /'  be  the  key  of  the  record  in  Ci  that  corresponds  to  h.  It  is  obvious  that 
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y  <  y' .  Now  let  y"  =  j"  ■  (7c  +  1)  +  f"  be  the  key  of  a  record  r  in  Ci  such  that  y  <  y" . 
We  only  need  to  show  that  y'  <  y" .  Since  both  f"  and  /  are  non-negative  integers  less  than 
7c  —  1,  the  fact  that  y  <  y"  leads  to  either  j  <  j",  or  j  =  j "  and  /  <  /".  If  j  <  j",  we 
immediately  have  y'  <  y" .  On  the  other  hand,  if  j  =  j" ,  then  the  record  r  also  corresponds 
to  a  down-bridge  associated  with  the  edge  (it,  v).  Since  h  is  the  leftmost  down-bridge  closest 
to  g,  we  have  f  <  /".  Thus  y'  <  y.  □ 

The  problem  of  finding  the  successor  of  an  integer  value  in  a  small  set  Ci  can  be  solved, 
again  using  the  Q-heap  data  structure.  The  following  straightforward  observations  ensure 
the  applicability  of  Lemma  2.2: 

•  \Ci\  <  log1/5  n  for  n  large  enough;  and 

•  The  total  number  of  distinct  keys  created  for  all  the  augmented  catalogs  is  bounded 
by  0(n). 

Finally,  it  is  easy  to  see  that  the  overall  additional  space  introduced  by  the  new  Q-heaps  is 
0(n),  and  thus  we  have  the  following  theorem. 

Theorem  3.1.  For  c  =  O(loge  n)  for  some  e  <  our  fast  fractional  cascading  structure 
allows  the  identification  of  the  successors  of  a  given  value  x  in  the  catalogs  associated  with  F 
in  0(p  -\-  t(n))  time,  where  t(n)  is  time  it  takes  to  identify  the  successor  of  x  in  L(w).  This 
structure  requires  0(n)  space. 


4  Orthogonal  Segment  Intersection 

Before  tackling  the  general  orthogonal  segment  intersection  problem,  we  develop  a  linear  size 
data  structure  to  handle  a  special  case  in  which  the  ^-coordinates  of  the  endpoints  of  the 
segments  and  the  query  segment  can  only  take  integer  values  over  a  small  range  of  values. 
We  will  later  show  how  to  use  the  solution  of  the  special  case  to  derive  a  solution  to  the 
general  problem. 

4.1  Modified  Vertical  Adjacency  Map 

Assume  that  the  x-coordinates  of  the  endpoints  of  each  segment  (ki,k2;y)  in  the  given  set 
R  of  n  horizontal  segments  can  take  values  from  the  set  of  integers  {1,2,...,  c} ,  where 
c  =  loge  n  is  an  integer,  and  furthermore,  assume  that  the  x-coordinate  k  of  the  query 
segment  r  =  (k;  z2)  is  an  integer  between  1  and  c.  Let  Y ( R )  =  (yi(R),  y2(R ), .  .  .  ,  yn’(R )) 
be  the  list  of  distinct  y-coordinates  of  the  segments  in  R  sorted  in  increasing  order. 

Our  overall  strategy  consists  of  augmenting  the  vertical  adjacency  map  with  auxiliary 
structures  so  that  we  will  be  able  to  identify  the  lowest  segment  in  R  intersecting  r  very 
quickly,  followed  by  progressively  determining  the  next  sequence  of  lowest  segments,  each  in 
0(1)  time.  The  details  of  this  strategy  are  described  next. 

Our  indexing  structure  D(R)  consists  of  two  major  components:  H(R)  and  M(R).  H(R) 
is  a  directed  vertical  adjacency  map  with  auxiliary  information  attached  to  it.  We  define  the 
direction  of  the  horizontal  edges  to  be  from  right  to  left  and  that  of  the  vertical  edges  to  be 
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from  bottom  up.  Note  that  we  do  not  require  that  each  face  of  H(R )  has  a  constant  number 
of  vertices  on  its  boundary. 

Each  vertex  of  H(R )  is  naturally  associated  with  a  pair  of  x,  y-coordinates.  We  call 
the  vertex  with  an  outgoing  horizontal  edge  a  tail.  We  augment  H(R)  with  three  types  of 
components  as  follows: 

•  For  each  distinct  y-coordinate  yj(R)  of  Y(R),  we  create  a  Q-heap  Qj(R)  to  index  the 
x-coordinates  of  the  vertices  whose  y-coordinates  are  equal  to  yj(R). 

•  For  each  integer  1  <  i  <  c  that  serves  as  the  x-coordinate  of  at  least  one  tail,  we  create 
a  list  Pi(R )  of  records.  Each  record  g  corresponds  to  a  tail  a  whose  x-coordinate  is  i 
and  contains  two  elements:  g.key}  which  is  the  y-coordinate  of  a,  and  g.ptr}  which  is 
a  pointer  to  a.  This  list  is  sorted  in  increasing  order  by  the  key  values. 

•  With  each  node  f3  we  associate  two  pointers  p(/3)  and  q(/3).  Let  yj(R)  be  the  y- 
coordinate  of  /3.  Then  p(/3)  points  to  the  Q-heap  Qj(R).  If  f3  is  not  a  tail,  q(/3)  is  null. 
Otherwise,  there  is  at  least  one  vertex  with  the  same  y-coordinate  as  f3  that  has  an 
outgoing  vertical  edge  and  is  to  the  strict  left  of  /3.  Let  7  be  the  rightmost  such  vertex, 
and  ei,  e2, .  .  . ,  e;  be  the  shortest  chain  of  vertical  edges  starting  from  7  such  that  the 
head  £  of  e;  has  an  incoming  horizontal  edge.  If  such  a  chain  exists,  then  q(/3)  points 
to  £.  If  not,  q(/3)  is  null.  Note  that  intuitively  q(/3)  is  the  top  left  corner  (if  it  exists) 
of  a  face  containing  /3. 

It  is  clear  that  H(R)  is  of  size  0(n). 

In  addition  to  H(R)}  we  have  a  bitmap  M(R)  consisting  of  a  list  of  bit-vectors.  Each 
vector  Vj(R)  corresponds  to  a  distinct  y-coordinate  yj(R)  and  contains  c  bits.  The  zth  bit, 
starting  from  the  most  significant  one,  is  set  to  one  if  there  is  a  vertical  edge  in  H(R)  passing 
through  the  point  (z,z/j(i?))  and  zero  otherwise.  Each  vector  can  easily  fit  in  a  single  word 
and  thus  the  storage  cost  of  M(R)  is  0(n).  These  vectors  are  aligned  with  the  lower  end  of 
the  words  and  are  stored  in  increasing  order  by  the  values  of  the  corresponding  y-coordinates. 

As  an  example,  Figures  5(a)  and  5(b)  illustrate  the  structures  H(R)  and  M(R).  In 
Figure  5(a),  the  dotted  lines  depict  the  c  possible  x-coordinates,  the  dashed  pointers  are  the 
^-pointers  that  are  not  null,  and  the  thick  line  represents  the  query  segment. 

Given  a  vertical  segment  r  =  (k;  27,  27),  we  first  identify  the  lowest  segment  that  intersects 
r  and  then  report  each  of  the  remaining  proper  segments  in  the  direction  of  increasing  y- 
coordinates. 

Locating  the  lowest  segment  that  intersects  r  is  performed  using  M(R).  Let  yj(R)  be  the 
smallest  ^-coordinate  greater  than  or  equal  to  27.  If  no  such  y  exists,  then  there  is  no  segment 
in  R  which  intersects  r.  Otherwise,  we  find  the  largest  value  i  <  k  such  that  the  zth  bit  in 
Vj(R)  is  one.  (This  number  always  exists  because  the  vertical  edges  whose  x-coordinates  are 
equal  to  c  form  a  infinite  line  and  therefore  the  lowest  bits  of  all  the  vectors  are  set  to  one.) 
This  can  be  accomplished  by  first  masking  out  the  highest  w  —  k  bits  of  Vj(R),  w  being  the 
number  of  bits  in  a  word,  and  then  locating  its  most  significant  bit.  In  [10],  Fredman  and 
Willard  describe  how  to  compute  the  most  significant  bit  of  a  word  in  constant  time. 

After  identifying  z,  we  use  Pi(R )  to  determine  the  record  g  with  the  smallest  key  larger 
than  or  equal  to  27.  We  can  then  immediately  obtain  the  vertex  a  pointed  to  by  g.ptr. 
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Figure  5:  H(R )  and  M(R). 

Lemma  4.1.  Let  (Ay ,ya )  be  the  coordinates  of  a.  Then  for  any  segment  (Ay,Ay;;y)  in  R 
such  that  Ay  <  k  and  ya  >  y  >  y i,  we  have  Ay  <  k.  That  is,  any  horizontal  segment  between 
y  ■ .  a n d  y  which  starts  to  the  left  of  r  ends  before  meeting  r. 

Proof.  I  lie  proof  is  by  contradiction.  Suppose  Ay  >  k.  We  then  have  Ay  <  Ay,  because 
otherwise  the  vertical  line  passing  through  a  would  have  had  at  least  one  vertex  a1  lying 
on  it  with  its  coordinates  (Ay/,;yQ,/)  satisfying  Ay,/  =  Ay  =  i  and  yai  <  ya,  which  contradicts 
the  way  we  chose  a.  Now  consider  the  vertical  line  passing  through  the  endpoint  (Ay,;y). 
Either  it  passes  through  the  point  (k2,yj(R))  or  intersects  a  horizontal  segment  whose  left 
endpoint  is  to  the  left  of  a  and  whose  y-coordinate  is  strictly  between  y  and  yj(R).  In  the 
first  case,  we  have  a  contradiction  because  there  would  have  been  a  more  significant  one-bit 
than  i  in  Vj(R).  In  the  second  case,  the  right  endpoint  of  that  horizontal  segment  has  to  beg 
to  the  strict  left  of  a,  following  the  same  argument  for  the  segment  (Ay,  Ay;  y).  By  repeatedly 
applying  this  argument,  we  can  show  that  either  there  is  a  one-bit  in  Vj(R)  more  significant 
than  the  or  there  is  a  record  in  Pi{R)  whose  key  is  smaller  than  ya  but  larger  than  jy, 
each  leading  to  a  contradiction.  □ 

Lemma  4.2.  Ifya  if  1J2,  then  the  horizontal  segment  t  =  (Ay,  Ay;  y)  on  which  a  lies  intersects 

r. 

Proof.  The  only  possible  scenario  in  which  t  does  not  intersect  r  is  when  Ay  >  k.  If  this 
is  the  case,  then  there  has  to  be  a  vertical  segment  (Ayyy^y^)  consisting  of  several  edges 
in  H  and  passing  through  the  point  (Ay,;y).  This  segment  cannot  cross  the  horizontal  line 
corresponding  to  Vj(R)  because  otherwise  there  would  have  been  a  more  significant  one-bit 
than  the  ?’th  in  Vj(R).  Therefore  there  has  to  be  a  horizontal  segment  t'  =  (k[,  kf,  y[)  with 
k2  >  Ay  >  k.  Lemma  4.1  implies  that  k[  >  k.  Repeating  this  argument  will  ultimately  lead 
to  a  contradiction.  □ 

Lemmas  4.1  and  4.2  show  that  the  horizontal  segment  t  on  which  a  lies  is  the  lowest 
segment  that  intersects  r.  Using  the  Q-heap  pointed  to  by  p{a),  we  can  find  the  vertex  (3 
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with  the  same  y-coordinate  as  a  and  the  smallest  x-coordinate  greater  than  or  equal  to  k. 
Since  t  intersects  r,  we  are  sure  that  f3  is  also  on  t.  The  following  lemma  explains  how  to 
iteratively  find  the  remaining  segments  that  intersect  r. 

Lemma  4.3.  Let  t  be  a  horizontal  segment  that  intersects  r  and  suppose  we  know  the  vertex 
f3  of  H(R)  on  t  with  the  smallest  x-coordinate  kg  larger  than  or  equal  to  k.  We  can  in  constant 
time  decide  whether  there  is  another  segment  t'  above  t  that  intersects  r ,  and  furthermore,  if 
there  is  one,  identify  in  constant  time  such  a  t'  having  the  smallest  y-coordinate  larger  than 
that  oft. 

Proof.  We  first  give  the  algorithm  to  compute  the  vertex  f3 '  on  t'  with  the  smallest  x- 
coordinate  kg>  larger  than  or  equal  to  k.  Consider  the  following  cases. 

Case  1  f3  has  an  outgoing  vertical  edge  e  and  k  =  kg. 

Case  1.1  e  is  an  infinite  edge,  i.e.  e  is  a  ray  shooting  upwards.  Then  there  are  no 
other  segments  intersecting  r. 

Case  1.2  The  edge  e  is  finite.  In  this  case,  the  vertex  f3 '  is  the  head  of  e  and  t'  is 
the  horizontal  segment  on  which  f3 '  lies. 

Case  2  f3  does  not  have  an  outgoing  vertical  edge  e  or  k  ^  kg. 

Case  2.1  q(/3)  is  null.  There  are  no  other  segments  intersecting  r. 

Case  2.2  q(/3)  is  not  null.  f3 '  corresponds  to  the  successor  of  k  in  the  Q-heap 

pointed  to  by  p(q(/d))  and  t'  is  the  horizontal  segment  on  which  f3 '  lies. 

We  now  show  the  correctness  of  this  algorithm.  We  only  discuss  Case  2,  as  the  correctness 
of  our  algorithm  for  Case  1  is  obvious.  First  consider  the  case  when  q(/3)  is  null.  Since  f3  has 
to  be  a  tail,  the  vertical  ray  starting  from  7  (introduced  in  the  definition  of  q(f3))  shooting 
upward  does  not  contain  a  vertex  with  an  incoming  horizontal  edge.  Hence  if  there  were  a 
horizontal  segment  above  t  that  intersects  r,  7  would  not  be  the  rightmost  vertex  to  the  left 
of  f3  that  has  an  outgoing  vertical  edge.  Hence  no  segment  above  t  intersects  r. 

We  now  consider  Case  2.2.  In  this  case,  7  and  the  chain  starting  from  it  always  exist.  Let 
e\ ,  e2, .  .  .  ,  e;  be  the  chain  of  vertical  edges  used  to  define  q(/3),  and  7  =  Si,  S2,  ■  ■  .  ,  h/+i  =  £  be 

the  sequence  of  vertices  such  that  for  each  1  <  j  <  /,  ej  =  (<hy ,  <hy+i ) ,  and  (C,j/7)  and  (kfy^) 

be  the  respective  coordinates  of  7  and  £.  We  claim  that:  (i)  no  horizontal  segment  whose 
y-coordinate  are  strictly  between  those  of  t  and  t'  intersects  r;  (ii)  the  horizontal  segment  t' 
on  which  £  lies  does  intersect  r;  and  (iii)  the  successor  f3 '  of  k  in  the  Q-heap  pointed  to  by 
p(q(j3))  always  exists. 

To  see  why  the  first  claim  is  true,  suppose  there  is  a  horizontal  segment  (k^k^y')  in¬ 
tersecting  r  that  satisfies  y1  <  y  <  y^.  Then  it  has  to  be  true  that  k'  <  k[  <  k.  Since 
we  are  discussing  Case  2,  there  has  to  be  another  horizontal  segment  {k'f^k'f^y")  such  that 
k'  <  k'f  <  k  and  y1  <  y"  <  y^.  Following  similar  arguments  as  in  the  proof  of  Lemma  4.1, 
we  can  show  that  either  there  exist  a  vertex  on  t  between  f3  and  7  with  an  outgoing  vertical 
edge,  or  there  exists  a  vertex  with  an  incoming  horizontal  edge  such  that  its  coordinate 
(£7/,  2/7)  satisfies  £7/  =  k'  and  y1  <  2/7  <  2/7  Either  case  leads  to  a  contradiction. 
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To  show  that  t'  indeed  intersects  r,  we  notice  that  the  right  endpoint  of  the  horizontal 
segment  of  which  the  horizontal  incoming  edge  of  £  is  a  part  cannot  be  to  the  (strict)  left 
of  5,  because  otherwise  either  there  would  be  a  chain  of  vertical  edges  closer  to  f3  than  the 
one  we  have,  or  there  would  be  a  horizontal  segment  lying  vertically  between  t  and  f  that 
intersects  r,  each  leading  to  a  contradiction.  This  also  justifies  the  last  claim  (iii),  and  the 
proof  of  the  lemma  is  complete.  □ 

Lemma  4.4.  Given  a  set  R  of  n  horizontal  segments  in  the  plane,  whose  endpoints  can  only 
have  c  =  loge  n  possible  x-coordinates  {1,2,...,  c},  it  is  possible  to  report  using  0(n )  space 
all  f  proper  segments  of  R  which  satisfy  a  query  r  =  (k;yi,y2),  where  k  =  1,  2, .  .  .  ,  c,  in 
0(t(n)  + /)  time,  where  t(n )  is  the  time  it  takes  to  compute  the  successor  of  y i  in  Z(R)  and 
Pi(R)  for  some  i  =  1,  2, .  .  .  ,  c. 

Note  that  we  can  apply  the  fusion  tree  technique  to  index  the  distinct  y-coordinates  using 
linear  space  so  that  t(n)  =  O(log  n/  log  log  n).  In  the  next  section,  we  will  show  how  to  use 
the  algorithm  of  Lemma  4.4  to  solve  the  general  orthogonal  segment  intersection  problem. 
By  applying  the  fast  fractional  cascading  technique,  The  time  t(n)  in  Lemma  4.4  can  be 
reduced  to  0(1)  except  for  the  initial  search,  in  which  t(n)  =  0(log  n/  log  log  n). 

4.2  Handling  the  General  Orthogonal  Segment  Intersection  Prob¬ 
lem 

In  this  section  we  consider  the  general  orthogonal  segment  intersection  problem  involving  a 
set  S  of  n  horizontal  segments.  To  simplify  the  presentation,  we  assume  that  the  endpoints 
of  the  segments  in  S  have  distinct  x-coordinates.  The  primary  data  structure  is  a  tree  T  of 
degree  c  =  loge  n,  built  on  the  endpoints  of  the  n  segments  sorted  in  increasing  order  of  the  x- 
coordinates.  Each  leaf  node  v  is  associated  with  c  endpoints.  Let  x\  and  xT  be  respectively  the 
x-coordinates  of  the  leftmost  endpoints  associated  with  v  and  the  leaf  node  to  its  immediate 
right  (ay  =  +oo  if  v  is  the  rightmost  leaf  node);  then  the  x-range  of  v  is  defined  as  [x\,xY). 
For  an  internal  node  u  with  c  children  v0,  rq, .  .  . ,  ry_i,  whose  corresponding  x-ranges  are 
[ay,  aq),  [aq,  ay), .  .  .  ,  [ay_i,  ay),  its  x-range  is  [ay,  ay).  The  set  of  c  —  1  infinite  horizontal  lines 
&i(rt),  b2(u ), .  .  .  ,  &c_i(rt),  whose  x-coordinates  are  aq,  ay, .  .  .  ,  ay_i  respectively,  are  called  the 
boundaries  of  u.  When  the  context  is  clear,  we  will  use  bfu)  to  represent  its  corresponding 
x-coordinate  as  well. 

The  segments  in  S  are  distributed  among  the  nodes  of  T  as  follows.  A  horizontal  segment 
is  associated  with  an  internal  node  u  if  it  intersects  one  of  the  boundaries  of  u  but  none  of 
the  boundaries  of  it’s  ancestors.  A  segment  is  associated  with  a  leaf  node  v  if  its  endpoints 
both  lie  within  the  x-range  of  v. 

The  set  S(v)  of  segments  associated  with  an  internal  node  v  is  organized  into  several 
secondary  data  structures  as  described  below  and  illustrated  in  Figure  6. 

•  The  c—  1  boundaries  of  each  node  v  are  indexed  by  a  Q-heap  so  that  given  an  arbitrary 
value  x  the  left  most  boundary  bi(y)  that  satisfies  x  <  bfv)  can  be  identified  in  constant 
time. 
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Figure  6:  Data  structures  for  the  segments  associated  with  node  v. 


•  With  each  boundary  bi(v)  with  1  <  t  <  c—  1,  we  associate  two  Cartesian  trees  Li(v)  and 
Ri(v).  The  Cartesian  tree  Li(v)  contains  the  endpoints  of  those  segments  (,?q,,r2 Vy) 
in  S(v)  which  satisfy  i(u)  <  aq  <  bi(v)  {bo(v)  =  — oo)  and  x2  >  bi(v),  and  is 
used  to  the  answer  the  three-sided  range  query  of  the  form:  (aq  <  a,  b  <  y  <  cl); 
and  Ri{v)  contains  the  endpoints  of  those  segments  that  satisfy  bi(v)  <  x2  <  bt+ i(u) 
(&i+i(r>)  =  +oo  for  i  =  c  —  f )  and  aq  <  bi(v),  and  is  used  to  answer  the  three-sided 
range  query  of  the  form:  (,r2  >  a,  6  <  y  <  cl).  Each  Cartesian  tree  thus  created  has  its 
nodes  doubly  linked  in  the  order  of  increasing  y-coordinates. 

•  Let  S'(v)  be  a  subset  of  S(v)  containing  segments  that  each  intersects  at  least  two 
boundaries  of  v.  We  organize  these  segments  using  the  data  structure  D(v)  discussed 
in  Section  4.1.  We  will  later  explain  how  to  transform  the  problem  corresponding  to 
S'(v)  to  the  one  discussed  in  Section  4.1. 

The  number  of  horizontal  segments  associated  with  a  leaf  node  is  at  most  c/2  since  there 
are  only  c  different  endpoints  associated  with  a  leaf  node,  which  are  simply  stored  in  a  list. 

We  analyze  the  storage  cost  of  the  structures  involved  in  our  overall  data  structure. 
Obviously,  each  segment  in  S  is  associated  with  exactly  one  node  v  of  T.  For  any  segment 
associated  with  an  internal  node  v,  it  appears  in  at  most  three  secondary  structures,  once 
in  Li(v)  associated  with  the  left  most  boundary  bi(v)  it  intersects,  once  in  Rj{v)  associated 
with  the  rightmost  boundary  bj(v)  it  intersects,  and  possibly  once  in  D(v).  Any  segment 
associated  with  a  leaf  node  is  stored  exactly  once.  Note  that  all  these  data  structures  are 
linear-space.  Hence  the  total  amount  of  space  used  by  these  structures  is  0(n).. 

We  next  outline  our  search  algorithm  and  then  fill  in  the  details  as  we  go  along.  Let 
s  =  (a;b,d)  be  a  vertical  segment.  To  avoid  the  tedious  but  not  difficult  task  of  treating 
special  cases,  we  make  the  assumption  that  the  endpoints  of  s  is  different  from  any  of  the 
endpoints  of  the  segments  in  S.  To  compute  the  set  of  proper  segments  in  Q,  we  recursively 
search  the  tree  T,  starting  from  the  root.  Let  v  be  the  node  we  are  currently  visiting.  We 
search  v  as  follows. 

1  If  v  is  a  leaf  node,  check  each  segment  associated  with  v  and  report  those  that  intersect 
s,  after  which  thq  algorithm  terminates. 


15 


2  If  x  lies  outside  the  x-range  of  v,  then  no  segment  in  S  intersect  s  and  the  algorithm 
terminates.  (This  can  only  happen  at  the  root,  when  s  is  to  the  left  of  all  the  segments 
in  S.) 

3  Otherwise  do  the  following: 

3.1  Find  the  pair  of  consecutive  boundaries  bi(v)  and  bi+i(v)  of  v  such  that  bi(y)  < 
a  <  bi+i(v).  (The  boundary  bi(v)  does  not  exit  if  x  <  &i(u);  and  bi+i(v)  does  not 
exist  if  a  >  bc_ i(v).) 

3.2  If  bi(y)  exist,  use  Ri(v)  to  report  segments  (aq ,x2;y)  that  satisfy  x2  >  a  and 
b  <  y  <  d. 

3.3  If  bi- ri (u)  exist,  use  Li+\(y)  to  report  those  segments  (aq,  x2]  y)  that  satisfy  x1  <  a 
and  b  <  y  <  d. 

3.4  If  both  bi(v)  and  bi+i(v)  exist,  use  D(y)  to  report  those  proper  segments  with  no 
endpoints  in  the  interval  (&4-(u),  &4-+i(u)). 

3.5  Recursively  visit  the  (i  +  l)th  child  of  v  (the  hrst  child  being  the  leftmost). 

The  correctness  of  the  algorithm  is  obvious,  provided  that  Step  3.4  can  be  performed 
correctly,  a  fact  we  will  show  shortly.  First  we  note  that  Step  3.1  can  be  done  in  constant 
time  using  the  Q-heap.  Furthermore  the  access  of  the  Cartesian  trees  in  Steps  3.2  and  3.3 
can  be  done  in  time  proportional  to  the  number  of  segments  reported  if  the  successor  of 
b  and  the  predecessor  of  d  in  the  list  of  nodes  for  each  Cartesian  tree  can  be  identified  in 
constant  time.  We  will  show  later  that  we  can  indeed  achieve  this  goal  by  applying  the  fast 
fractional  cascading  structure.  Finally,  it  is  clear  that  only  one  node  is  visited  at  each  level 
of  T,  which  consists  of  O(log  n/  log  log  n)  levels. 

Now  we  focus  on  Step  3.4.  The  difficulty  is  to  keep  the  size  of  D(y)  linear  and  at  the  same 
time  be  able  to  execute  this  step  in  time  proportional  to  the  number  of  segments  reported. 
Let  n'  be  the  size  of  S'{y).  One  obvious  choice  is  to  keep  as  D(y)  as  0(c2)  lists  of  segments. 
Each  list  corresponds  to  a  pair  of  boundaries  and  consists  of  segments  sorted  by  their  y- 
coordinates  that  cross  both  boundaries.  The  storage  cost  is  obviously  0{n').  However,  we 
will  have  to  visit  each  list  to  report  the  proper  segments,  since  there  is  no  obvious  way  to 
decide  beforehand  which  lists  contain  at  least  one  proper  segment  (as  Willard  cleverly  did  in 
the  design  of  the  fusion  priority  tree  [24]).  At  least  O(log2en)  time  seems  to  be  required  as 
a  result.  On  the  other  hand,  we  can  associate  with  each  pair  of  consecutive  boundaries  the 
sorted  list  of  segments  that  crosses  both  of  them.  This  approach  satisfies  the  requirement 
on  the  query  complexity  but  increases  the  storage  cost  by  a  factor  of  loge  n. 

We  now  present  our  solution  to  handle  these  segments.  We  hrst  transform  the  x- 
coordinates  x1  and  x2  of  the  endpoints  of  each  segment  s  into  two  integers  k\  and  k2  between 
1  and  c—  1.  More  specifically,  k\  and  k2  are  the  indices  of  the  leftmost  and  rightmost  bound¬ 
aries  of  v  crossed  by  s.  By  doing  this,  we  transform  S'{y)  into  another  set  W(v),  in  which  the 
segments  have  their  y-coordinates  unchanged  but  their  x-coordinates  replaced  by  the  indices 
of  the  boundaries.  At  query  time,  we  also  transform  the  query  segment  s  =  (x;yi}y2)  into 
another  segment  r  by  replacing  its  x-coordinate  with  the  index  k  of  the  boundary  to  its 
immediate  right.  It  is  straightforward  to  see  that  a  segment  in  S\v)  is  proper  if  and  only  if 
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its  corresponding  segment  (A’i,A’2;;y)  in  W(v)  satisfies  Ay  <  k  <  k-2  and  jq  <  y  <  y2.  (In  the 
case  where  Ay  =  A’,  the  original  segment  corresponding  to  (A’i,A’2;;y)  is  already  found  using 
L^iv)  and  thus  need  not  be  reported  here.)  We  now  have  exactly  the  problem  we  tackled 
in  Section  4.1.  Hence  by  Lemma  4.4,  we  can  find  the  f  proper  segments  in  W(v)  in  0(f) 
time,  provided  that  we  can  in  constant  time  identify  the  successor  of  b  in  the  various  sorted 
lists  of  y-coordinates  associated  with  H(v). 

To  complete  the  description  of  our  algorithm,  we  show  how  to  apply  the  fast  fractional 
cascading  structure  to  search  the  sorted  lists  at  different  levels  of  the  tree.  The  sorted  lists 
stored  at  each  node  v  consist  of  the  2 (c  —  1)  lists  Ri(v)  and  Li(v)  for  i  =  1, ...  ,  c  —  1,  the;- 
list  of  vectors  in  M(v),  and  up  to  c  —  1  lists  of  Pi(v).  Note  that  during  the  query  time,  we 
only  need  to  search  0(1)  such  lists  at  each  level.  Using  the  fusion  tree,  we  can  search  the. 
relevant  list  at  the  root  of  T  in  0(log  n/ log  log  n )  time. 

To  see  how  the  various  lists  are  linked  through  fast  fractional  cascading,  We  can  imagine 
a  virtual  forest  F  consisting  of  c  “virtual”  trees  Ti,T2,  .  .  . ,  Tc  of  degree  3c  —  2,  such  that 
the  lists  stored  at  the  roots  are  respectively  Lf  v),  L2( v), .  .  .  ,  Tc_i( v),  Rc-i( v),  where  v  is 
the  root  of  our  search  tree.  The  children  of  the  root  containing  Lfu)  contain  the  lists  in 
the  zth  children  of  u  from  the  left;  and  the  children  of  i?c_i(u)  are  the  lists  in  the  cth 
children.  Figure  7  illustrates  the  concept  of  the  virtual  forest.  It  is  straightforward  to  see 
that  a  node  in  F  is  searched  only  if  its  parent  is  searched.  Since  c  =  loge  n  with  e  <  1/5, 
3c  —  2  <  log1/5  n  for  large  enough  n.  Therefore  we  can  apply  the  fast  fractional  cascading 
technique  to  interconnect  the  lists  according  to  the  topology  of  the  virtual  forest  so  that 
we  can  search  in  constant  time  each  list  after  the  initial  search  at  the  root  of  F  without 
increasing  the  space  requirements. 


Figure  7:  Th§- .virtual  forest. 

In  summary,  handling  a  query  consists  of  processing  the  nodes  on  a  path  from  the  root 
to  a  leaf  node;  Processing  the  root  w  of  T  takes  0(log  n/ log  log  n  +  f(w))  time.  The  time 
spent  at  processing  any  other  internal  node  u  is  0(f(u)).  To  search  the  leaf  node  w,  we 
simply  check  each  segment  stored  there.  Since  there  are  at  most  0(c)  such  segments  and 
c  =  loge  n  <  log  n /  log  log  n  for  large  enough  n,  the  overall  query  time  is  (9 (log  n/  log  log  n ) 
and  therefore  we  have  the  following  theorem. 

Theorem  4.1.  There  exists  a  linear-space  algorithm  to  handle  the  orthogonal  segment  in¬ 
tersection  problem  in  0(log  n/ log  log  n  + /)  query  time f  where  f  is  the  n  umber  of  segments 
reported. 
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5  Rectangle  Point  Enclosure 

To  simplify  our  presentation,  we  assume  that  the  corners  of  the  rectangles  in  S  all  have 
distinct  x-  and  y-coordinates.  As  in  the  case  of  the  segment  intersection  problem,  the 
primary  data  structure  consists  of  a  tree  T  of  degree  c  =  loge  n.  Let  v  be  the  root  of  T 
and  &i(u),  b2(v)} ...,  &c_ i(u)  be  a  set  of  infinite  vertical  lines,  called  the  boundaries  of  u,  which 
partition  the  set  of  2 n  vertical  edges  of  the  rectangles  in  S  into  c  subsets  of  equal  size,  thereby 
creating  c  stripes  -Pi(n),  P2(v)}  •  •  •  5  Pc(v)-  We  define  the  c  subtrees  rooted  at  the  children  of 
v  recursively,  each  with  respect  to  the  vertical  edges  that  fall  into  the  same  stripe.  If  the 
number  of  vertical  edges  is  less  than  loge  n  in  a  stripe,  the  child  node  corresponding  to  this 
stripe  becomes  a  leaf  node.  Clearly  the  height  of  this  tree  is  O(log  n/  log  log  n).  A  Q-heap 
Q(y )  holding  the  boundaries  of  v  is  built  for  each  node  u,  which  will  enable  a  constant  time 
identification  of  the  stripe  of  v  the  query  point  belongs  to.  In  addition,  for  each  node  u, 
except  for  the  root,  we  define  its  x-range  as  the  the  stripe  Pi(u)  of  its  parent  it,  assuming  v 
is  the  zth  child  of  u  from  the  left. 

We  associate  with  each  internal  node  v  the  rectangles  that  intersect  at  least  one  of  its 
boundaries  but  none  of  the  boundaries  of  its  ancestors.  Each  leaf  node  contains  the  set  of 
rectangles  both  of  whose  vertical  edges  lie  within  its  x-range.  Hence  at  most  O(loge  n)  = 
O(log  n/  log  log  n)  rectangles  are  associated  with  a  leaf  node,  and  no  preprocessing  will  be 
required  for  these  rectangles. 

Now  consider  the  set  S(v)  of  rectangles  associated  with  an  internal  node  v.  As  in  [4], 
we  build  two  hive-graphs  HLi(y)  and  HRi( v),  as  defined  in  Section  2.3,  for  each  boundary 
bi(v),  i  =  1, ...  ,  c  —  1.  The  hive-graph  HLi(y )  is  built  on  left  vertical  edges  lying  inside 
stripe  Pi  and  is  used  to  answer  the  semi-infinite  segment  intersection  queries  of  the  form: 
(aq  <  x,yi  <  y  <  y2);  and  HRi(v)  is  built  on  the  right  vertical  edges  lying  inside  stripe 
Pi- |_i  and  is  used  to  answer  the  semi-infinite  segment  intersection  queries  of  the  form:  (x2  > 
xRJi  A  V  A  2/2)-  In  addition,  let  S'(y)  be  a  subset  of  S(y)  such  that  each  rectangle  in 
S'(y)  crosses  at  least  two  boundaries.  We  transform  the  coordinates  of  S'(y)  from  the  space 
3?  X  3?  to  W(v)  in  the  space  [1,  2, .  .  . ,  c  —  1]  X  3?  as  follows.  We  transform  each  rectangle 
[aq,  x2]  yi,  y2]  in  S\v)  into  the  rectangle  [£q,  k2;  iq,  y2\  in  W(v),  where  b^iy)  and  bk2(v )  are 
the  leftmost  and  rightmost  boundaries  it  crosses. 

We  now  turn  our  attention  to  the  query  algorithm  and  postpone  the  description  of  the 
data  structure  for  W(v)  until  the  end  of  this  section.  Using  the  Q-heaps  stored  at  the 
internal  nodes,  we  can  in  O(log  n/  log  log  n)  determine  the  path  from  the  root  to  the  leaf 
node  whose  x-range  contains  the  query  point  p  =  (x,  y).  (We  assume  for  simplicity  that  the 
x-  and  y-coordinates  of  p  are  different  from  that  of  the  corners  of  the  rectangles  in  S.)  It  is 
clear  that  only  the  rectangles  associated  with  the  nodes  on  this  path  can  possibly  contain 
the  query  point. 

Consider  a  node  v  on  this  path.  If  v  is  a  leaf  node,  we  simply  examine  each  rectangles 
associated  with  it,  a  process  that  takes  O (log  n/  log  log  n  +  f(v))  time,  where  f(v)  denotes 
the  number  of  rectangles  reported  at  node  v. 

If  v  is  an  internal  node,  we  first  decide  which  stripe  of  v  the  query  point  p  belongs  to,  a 
task  that  can  be  done  in  0(1)  time,  say  it  belongs  to  Pi(v).  The  rectangles  stored  at  node 
v  and  which  contain  p  can  be  classified  into  three  groups:  (i)  the  set  L(y)  that  contains 
the  rectangles  whose  left  vertical  edges  lie  inside  Pi(v);  (ii)  the  set  R(v)  that  contains  the 
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rectangles  whose  right  vertical  edges  lie  inside  Pi(v);  and  (iii)  the  set  F(v)  consisting  of  those 
rectangles  whose  horizontal  edges  cross  Pi(v)  entirely.  If  i  =  1,  R(v)  and  F(v)  do  not  exist. 
Similarly,  if  i  =  c,  L(y)  and  F(y)  do  not  exist. 

By  Lemma  2.4,  the  rectangles  that  belong  to  the  first  two  groups  can  be  identified  in 
0(1)  time  per  rectangle  reported  if  we  apply  the  fast  fractional  cascading  technique  on  the 
lists  of  the  sorted  y-coordinates  of  the  vertices  of  the  corresponding  hive-graph.  For  example, 
we  know  that  each  rectangle  (aq,  x2]  y2} 1J2)  associated  with  the  hive-graph  HLi(v)  satisfies 
x2  >  x.  Therefore,  to  find  rectangles  in  L(v),  we  only  need  to  check  the  criteria:  x1  <  x  and 
j/i  <  y  <  y.  Also  note  that  a  proper  rectangle  can  be  reported  at  most  once  in  this  process. 

The  remaining  task  is  to  determine  the  rectangles  in  group  F(v),  which  requires  an 
additional  data  structure.  We  start  from  the  set  W(v )  consisting  of  rectangles  of  the  form 
(£q,  k2;  j/i,  2/2),  where  k\  and  k2  are  integers  between  1  and  c  —  1.  For  each  pair  of  different 
integers  i  <  j  between  1  and  c—  1,  we  construct  a  cartesian  tree  Cij(v)  consisting  of  rectangles 
(z ,  j ;  2/1 , 2/2)  in  W(v )  to  answer  the  two-sided  range  queries  in  the  form  2/1  <  y  <  y2.  Note 
that  the  total  space  is  still  linear  and  the  use  of  the  fast  fractional  cascading  technique  will 
enable  us  to  access  the  appropriate  nodes  in  time  proportional  to  the  number  of  rectangles 
reported. 

However,  we  still  need  to  resolve  the  problem  of  identifying  which  of  these  Cartesian  trees 
should  be  accessed  when  handling  a  query.  We  cannot  afford  to  access  such  a  tree  unless 
we  are  guaranteed  to  find  at  least  one  proper  rectangle.  To  address  this  problem,  we  do  the 
following. 

We  construct  a  look-up  table  M(y)  with  n'  rows,  each  corresponding  to  a  distinct  y- 
coordinate  of  the  horizontal  edges  of  the  rectangles  in  W (u)  and  occupying  one  word  (of  log  n 
bits).  The  rows  are  sorted  by  increasing  order  of  the  2/-coordinates.  Let  yi(v)  <  y2(v)  <  ■  ■  ■  < 
yn'{y)  be  the  set  of  distinct  y-coordinates  of  the  horizontal  edges.  Let  Vj(v)  =  (bc 3,  bc3_1}  bi) 
be  a  sequence  of  c3  bits,  where  bi  is  the  zth  bit  from  the  lower  end  of  the  word  representing 
the  jth  row  of  M(y)  (note  that  c3  <  logra).  The  word  Vj(v)  is  evenly  divided  into  c  sections, 
each  corresponding  to  a  stripe  of  v  (actually  we  only  use  c  —  2  of  them  which  correspond 
to  P2(v)} .  .  . ,  Pc-i(v)).  Let  (bi+c2  bi+c2_1  ■  ■  ■  &;+1)  be  one  of  them  that  corresponds  to  Pi(v)} 
i  =  2, .  .  .  ,  c  —  1.  For  each  pair  of  integers  k\  <  i  <  k2  between  1  and  c  —  1,  we  set  the  bit 
bi+k1-c+k2  t°  one  if  there  is  a  rectangle  (£q,  k2l  y  1,  y2)  in  R  such  that  2/1  <  Vj(v)  <  y2.  All  the 
other  bits  are  set  to  zero. 

To  find  the  proper  rectangles  in  S\v)}  we  first  transform  in  0(1)  time  using  Q(v)  the 
query  point  (x,y)  to  the  point  (k,y)  in  the  same  space  as  W(v).  Let  bk(v)  be  the  leftmost 
boundary  of  v  whose  x-coordinate  is  greater  than  or  equal  to  x.  It  is  clear  that  a  rectangle 
(aq,  x2 ;  y  1,  y2)  in  S'{y)  contains  (x,  y)  if  and  only  if  its  corresponding  rectangle  (£q,  k2 ;  y  1,  y2) 
in  W(v )  contains  (&,  y).  Let  y3(v)  =  min{2/;(n)|l  <  l  <  n',  yi(v)  >  y}.  We  have  the  following 
lemma. 

Lemma  5.1.  Let  (&/+c2&/+c2_1  •  •  •  6;+1)  be  the  section  of  Vj{y)  which  corresponds  to  Pk(v). 
Then  for  each  pair  of  integers  1  <  F  <  k2  <  c  —  1  such  that  k\  <  k  <  k2,  biJrk1-c+k2  1  if 
and  only  if  there  exists  a  rectangle  (£q,  k2l  2/1,  y2)  £  W(v )  which  contains  (k,y). 

Proof.  By  the  dehnition  of  V)(u),  bi+k1.c+k2  =  1  if  and  only  if  there  exists  a  rectangle 
(£q,  k2 ;  2/1, 2/2),  such  that  2/1  <  Vj(v)  <  y2.  If  this  rectangle  indeed  exists,  we  have  k\  <  k  <  k2 
and  2/2  A  y2{v)  A  1J-  The  dehnition  of  y3(v)  ensures  that  y  >  j/i-  Therefore  (ki7k2;yi7y2) 
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contains  (&,  y).  Now  suppose  there  is  a  (Aq,  k2;  j/i,  y2)  in  IF (n)  which  satisfies  ki  <  k  <  k2  and 
contains  ( k}y ).  The  only  scenario  in  which  (Aq,  &2;  j/i,  j/2)  does  not  satisfy  jq  <  j/2(n)  <  j/2  is 
yj{y)  =  y  =  y\.  This  is  not  possible  given  the  assumption  that  y  can  not  be  the  y-coordinate 
of  any  horizontal  edge1.  □ 

Lemma  5.1  shows  that  the  section  B  of  Vj(v)  which  corresponds  to  the  stripe  containing 
(k}y)  indicates  correctly  the  Cartesian  trees  in  {Ci^\2  <  i  <  j  <  c  —  1}  which  should  be 
visited.  Using  a  look-up  table  of  size  0(n)}  similar  to  the  one  described  in  [21],  we  can 
transform  B  into  a  list  of  integers  (to,  A,  /2, .  .  . ,  /m),  where  to  is  the  number  of  1-bits  in  B 
and  Ii  is  the  index  of  a  unique  Cartesian  Cij(v),  for  /  =  1,  2, .  .  .  ,  to.  Then  we  simply  visit 
these  Cartesian  trees  one  by  one. 

Searching  the  sorted  lists  associated  with  non-root  nodes  can  be  done  using  fast  fractional 
cascading.  The  correctness  proof  and  complexity  analysis  for  this  part  is  similar  to  that  in 
Section  4  and  thus  is  omitted  here. 

Theorem  5.1.  There  exists  a  linear-space  algorithm  to  handle  the  rectangle  point  enclosure 
queries  in  O (log  n/  log  log  n  +  /)  time ,  where  f  is  the  number  of  segments  reported. 
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